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Linear regression models are among the models most used in 
practice, although the practitioners are often not sure whether their 
assumed linear regression model is at least approximately true. In 
such situations, only designs for which the linear model can be checked 
are accepted in practice. For important linear regression models such 
as polynomial regression, optimal designs do not have this property. 
To get practically attractive designs, we suggest the following strat- 
egy. One part of the design points is used to allow one to carry out 
a lack of fit test with good power for practically interesting alterna- 
tives. The rest of the design points are determined in such a way that 
the whole design is optimal for inference on the unknown parameter 
in case the lack of fit test does not reject the linear regression model. 

To solve this problem, we introduce efficient lack of fit designs. 
Then we explicitly determine the efc-optimal design in the class of 
efficient lack of fit designs for polynomial regression of degree k — 1. 

1. Introduction. Linear regression models are among the models most 
used in practice. Such a parametric assumption for the regression function 
is very attractive among practitioners, although they are often not sure 
whether their assumed linear regression model is at least approximately 
true. Therefore, if a design can be chosen (according to which the data 
are sampled), the practitioners spread out the design points over the whole 
experimental region. For important linear regression models such as polyno- 
mial regression, such designs and classical optimal designs are quite different. 
Even more serious when using such an optimal design, deviations from the 
assumed polynomial regression model are not detectable. In this paper we 
address these concerns. 
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To explain the above mentioned problem in more detail, let us consider 
the linear regression model 

(1.1) Y = Xe + e, 

where Y = (Yi, . . . , Yn)^ is the vector of observations, X is the design (model) 
matrix, E is an unknown parameter vector and e = (ei, . . . , en)"*" is the 
vector of errors. In this paper we assume that ei,...,e„ are independent, 
identically distributed real random variables with expectation and un- 
known variance o"^ G (0, oo). Furthermore, we assume X = X„ = (/(xi), . . . , 
f{xn))^, where (xi, . . . , Xn) G is a design for n observations, <5 = [a, 6] C R 
is some compact interval and / = (/i, . . . , fj.)^ : f — > R*^ is a vector of known 
continuous regression functions that have bounded variation. 

It is common to solve classical experimental design problems for linear 
regression models of the form (1.1) in an approximate way. To do this, one 
identifies an arbitrary design with n design points (xi, . . . , Xn) ^ with the 
probability measure £,n'-=- Si^i ^Xi-, where 5t is the Dirac measure in t G <S. 
Then it is more feasible to solve a classical design problem in the set of all 
probability measures on £ instead in S„ := {^Y^'^=i^Xi\{x\, . . . ,Xn) G 
or in yy^^i'En. If we are interested in inference on the parameter vector 
6, where K G R'^^*, 1 < s < A;, vank{K) = s, then, given the design ^, 
the variance/covariance matrix of the best linear unbiased (least squares) 
estimator for 6 is given by 

-K^MiCr'K, M(e):= / ffdC 
n Js 

[provided that M(^) is invertible, otherwise we have to deal with general- 
ized inverses]; see [17], page 65. Hence, the "quality" of statistical inference 
on the unknown parameter 6 depends on the choice of the design ^. The 
most interesting optimality criteria, such as the i?i)p-criteria, are functions of 
the information matrix C(^) := {K~^ M {^)~^ K)~^ . By a famous theorem of 
Caratheodory (see, e.g., [22]), the optimal designs for a criterion that is a 
function of the information matrix can be determined in the set of prob- 
ability measures with k{k + l)/2 mass points instead of in the set of all 
probability measures. Indeed, the classical optimal designs for polynomial 
regression were determined in this set of probability measures just men- 
tioned. Therefore, in case the number k of different regression functions is 
small (which is most interesting in practice) , the optimal designs have only a 
small number of different design (mass) points. For instance, for polynomial 
regression of degree /c — 1, the known classical A- and e^-optimal 

[with Cfc = (0, . . . ,0, l)""" G R*^] designs have only k different design points. 
There is a huge literature on classical optimal designs in case model (1.1) is 
true; see, for example, [9, 12, 14, 17, 22]. 
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To check models, Wiens [24] introduced optimal lack of fit- (LOF-) designs 
based on the power of a given lack of fit test. Wiens [24] considered the usual 
lack of fit test for regression models of the form (1.1). This approach was 
recently generalized by Biedermann and Dette [1]. 

It is of practical interest to develop (in some sense) optimal designs that 
offer the possibility to check whether the assumed model is true (or is at 
least not so far away from the true one) and with which one makes good 
inference on 6 in case the assumed regression model is true. To establish 
designs which fulfill the above two postulates, we combine both ideas. For 
that, let g:£ be the true but unknown regression function. Then the 
hypothesis 

(1.2) Ho:3e={ei,...,0ky gR'' with g = f 6 

is of practical main interest. A test for this hypothesis is called "lack of 
fit"-test (LOF-test). Loosely speaking, our approach is then as follows. Let 
r G [0, 1] and let a certain lack of fit test be given. Then an optimal LOF- 
design has the "best" power for certain alternatives (of regression functions) 
that are separated from Hq; see Section 2.2. At first, we determine the 
class of r-efficient LOF-designs, that is, all designs for which the efficiency 
with respect to the optimal LOF-design is r. Hence, for r = 1, we obtain 
the optimal LOF-designs. Note that the known classical optimal designs for 
polynomial regression models, however, often belong to the class of 0-efficient 
LOF-designs. Next, in the class of LOF-designs which are at least r-efficient, 
the optimal design with respect to statistical inference on 9 is determined 
under the assumption that the model (1.1) is true. In practice, the value r 
of efficiency has to be chosen by the practitioner according to the statistical 
problem under consideration. Then the data are sampled according to the r- 
efficient-optimal LOF-design. In case model (1.1) is rejected by the LOF-test, 
the data are useless for inference on 6. In case model (1.1) is not rejected, 
the data are used for inference on 0. 

In Section 2 we compare ours with other approaches that handle the prob- 
lem under consideration. Furthermore, the class of r-efficient LOF-designs 
is established there. Then we determine e^-optimal designs in the class of 
at least r-efficient LOF-designs for polynomial regression of degree /c — 1 in 
Section 3. For corresponding results for a general linear regression model 
(1.1), see [15]. Moreover, we show the relation to Bayesian optimal designs. 
Proofs are postponed to the Appendix. 

2. LOF-designs. 

2.1. LOF-tests and an asymptotic aspect of LOF-designs. Let us again 
consider the assumed linear model (1.1), Y = X6 + e = (/(x^i),.... 
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f{xnn)) + e, the true model, Y = {g{xni), ■ ■ -^gixnn)) + and the hy- 
pothesis (1.2), Ho:3e = {ei,..., OkY G with g = fO. Many LOF-tests 
are possible for Hq. Wiens [24] considered the usual LOF-test that has a 
noncentral F-distribution if the errors are normally distributed. This holds 
asymptotically as well under mild conditions; see [24]. If the regression func- 
tion and the density of the design ^ fulfill some smoothness assumptions, 
then tests based on nonparametric estimation of the regression function are 
possible. Biedermann and Dette [1] proposed three LOF-tests of this kind. 
Every test mentioned above has an asymptotic (for converging in some 
sense to ^, as n ^ oo) power, which is a function of 

B{g,0 := ^ /^((prf^;(«)^^j, <7)(x))'e(rfx), 

where pr^^'-^^^^p is the orthogonal projector onto [/i, . . . , //fc]"*" in -L^(^); 
see [1, 24]. The greater B{g,^) is, the greater is the asymptotic power. It is 
easy to argue that we have to put B{g,^) := oo for g ^ L'^iO- practice, 
however, it is sufficient to consider regression functions that have bounded 
variation. Additionally, we assume that the regression function is regular, 
which here means g is continuous from the left in b and continuous from 
the right in [a,b). We denote this class of functions by BV{£) = BV[a,b]. 
In the sequel we assume that the true regression function g is an element of 
BV{£). 

Opposed to the approximation approach for the classical design problem, 
an asymptotic statistical argument is decisive for the set of probability mea- 
sures in which the LOF-design problem should be solved. Bischoff [2, 3] and 
Bischoff and Miller [4] consider partial sum processes of regression models. 
These papers imply that a regression function g G BV{£) which does not 
belong to the assumed regression model can be detected asymptotically if 
the sequence of exact designs converges uniformly to an asymptotic design 
(probability measure) that has an absolutely continuous part with posi- 
tive density which belongs to BV{£). Uniform convergence of probability 
distributions means uniform convergence of the corresponding distribution 
functions. Therefore, as a class of interesting designs, we consider the set 
H of probability measures on £ that can be decomposed into a finitely dis- 
crete part and an absolutely continuous part with respect to the Lebesgue 
measure whose density belongs to BV{£). Moreover, the papers mentioned 
above additionally show that B(g,(^) is a suitable measure for how well 
an alternative regression function can be detected. Indeed, \/B{g,S,) is the 
norm of the residual sum limit process with respect to the reproducing ker- 
nel Hilbert space of the corresponding limit process. Before we continue, we 
have to approximate G H by a design in H„ if n is the fixed number of 
observations. To this end, let FQ{t) := ^((— oo,t]),t G £, be the distribution 
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function of ^, and let Qq be the right continuous inverse of Fq. Then the 
design Cn = k ELi ^ with 

(2.1) Xni+i:=Qo{i/{7T-- I)), i = 0,...,n-l, 

has the property that converges uniformly to ^. These designs correspond 
to designs defined by Sacks and Ylvisaker [20]. 

There are several other approaches that handle the problem under consid- 
eration. For polynomial regression of fixed degree, for instance, Dette [8] and 
Pukelsheim and Rosenberger [18] consider designs for a polynomial of higher 
degree (as an alternative) and a mixture of two optimality criteria. Box and 
Draper [7] (see also [11]) look for designs minimizing a bias. Montepiedra 
and Yeh [16] uses a sequential approach. Biswas and Chaudhuri [6] select 
the correct model from a known finite family of nested linear models and es- 
timate the parameters associated with that model. But the optimal designs 
of these approaches do not have the property that each fixed alternative of 
the class mentioned above can be discovered as n — > oo. 



2.2. Optimal LOF-designs. For the definition of LOF-efficiency, it is tech- 
nically necessary to consider a subset of alternatives which is separated from 
the hypothesis Hq. Following Biedermann and Dette [1], let v G BV{£) be 
a weight function which gives more weight to those points of £ for which a 
deviation is more serious, and let A be Lebesgue measure. Then we consider 
{h £ BV{£)\ Jgh'^vdX > c, fhvdX = 0^}, c > fixed, as a set of alter- 
natives which is separated from the hypothesis Hq. Let A be the uniform 
distribution on £, that is, A = (A(<f))~^ • X\£. By choosing v:£ ^ [0,oo) with 



(2.2) j vd\ = l 

and c > suitably, the above set of alternatives can be written as 

J^^^c--=\heBV{£)\ j h^vdX>c,J //iudA = Ofc|. 

In the sequel let J^^^c he fixed and let v satisfy (2.2). Following Wiens [24], 
we continue with a maximin approach. 



Definition 2.1. (a) [24]. A design £ - is called LOF-optimal if 
max min i?(/i,^)= min B{h,^Q). 

(b) The relative LOF-efficiency of a design G H is 
EffLOF(6) = min B{h 

where .^o is an optimal LOF-design. 



EffLOF(6)= min B{h,ii) / m:m S(/i, Co) G [0, 1], 
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2.3. Efficient LOF-designs. Wiens [24] computed optimal LOF-designs 
for V = const. Biedermann and Dette [1] generalized this result to arbi- 
trary V. These papers imply that the optimal LOF-design is the probability 
measure v ■ X, where for a measure rj defined on the Borel field B and a 
Borel- measurable function w.M — > [O,cxo) the measure w ■ r] is defined by 
{w ■ i])(A) := w dri, A£ B. Next we give a generalization of this result. We 
use for two measures //i,//2 on B the notation /^i < /i2 ^ Vi? E B:^i{B) < 

Theorem 2.2. The set of designs with relative LOF-efficiency greater 
than or equal to r £ [0, 1] is given by T„[r] := G E\rv • A < ^}. 

Given ^ ^ T„ [r] , the main part of the proof is to construct a regression 
function ho £ T^^c with (EffLOF(6 <)B{hQ,C)/ min/igj^^^ • A) < r. The 

proof is mainly along the lines of Wiens [24] and Biedermann and Dette [1] 
and therefore is omitted. An immediate consequence of Theorem 2.2 is the 
following interesting corollary. 

Corollary 2.3. Let eE and r := sup{t|tt; • A < ^} G [0, 1]. Then the 
LOF-efficiency of is equal to r. 

3. efc-optimal designs in T„ [r] for polynomial regression. In this section 
we calculate efc-optimal designs in T^,[r], where v is an arbitrary weight func- 
tion, in case the parametric model (1.1) is the polynomial regression model 
of degree k — 1, that is, f{x) = (l,x, . . . ,x^~^)~^ , k>2, where = (0, . . . ,0, 
l)"*" G M^. As discussed in [15], we can consider the experimental region 
£^ = [—1,1] for e^-optimality, without loss of generality. Furthermore, the 
efc-optimal design in T„[r] for polynomial regression is unique; see also [15]. 
The proof of the following theorem can be found in the Appendix. 

Theorem 3.1. A design ^ G '^v[f] is optimal for eJO in T^[r] if and 
only if 

VyG[-l,l], VzG{zG[-l,l]|e({4)>0}: 

(3.1) 

{elM{0-'f{y)f < {elM{0-'mf- 
Moreover, the optimal design for ej6 in T^[r] has the form rt; • A + (1 — r) x 
J2i=iPiSti, where ti,...,t££ [—1,1] are l{< k) different points with |e^ x 
^{C)~^f{^i)\ =™aXj^e[-i,i]«^fc^^(0"V(y) andpi,...,pi£ (0,1] are suitable 
values with J2i=iPi = 1- 

Note that our design problem is related to a Bayesian design problem. 
For that, let rv ■ \ + {1 — r)C* be the ej0-optimal design in Ty[r] and let 
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Mq := M{v • A). Then ("* is the Bayesian optimal design for ej,6 of the 
Bayesian design problem with a priori information Mq. This means that 
maximizes the information matrix' tMq + (1 — r)M(C) with respect to ^ G H; 
see [10], Section 5, [8], [17], Chapter 11, and [15]. 

Kiefer and Wolfowitz [13] showed that 2(fc-i) + ^i) + k~i ^ 

Ei=i^5cos(7r(fc-i-j)/(fc-i)) is an efc-optimal design in S; see also [23]. The 
support points of this optimal design are the extremal points in [—1,1] of 
the Chebyshev polynomial Tf^-i of degree k — 1. See [19] and [21] for prop- 
erties of Chebyshev polynomials Tn{x),n £ No,x E [—1, 1]. The proof of the 
following main result can be found in the Appendix. 

Theorem 3.2. Let n = 1/2 i/i G {0,A; - 1}, Ui = l if i e {I, . . . ,k - 2}, 
let Pi = — rqi, i = 0, . . . ,k — 1, where 

i^i ( j{k-l-i)'K'^ '■1 



cos(^ ^— ^ j j^Tj{x)v{x)dx, 



and let 

(3.2) af^ := min{ui/{{k - l)qi)\i = 0,...,k-l with qi > 0}. 

Then for r G [0,a^Q^^], the design ^ = rv ■ X + "EiZo PAos{7T{k^i~i)/{k~i)) 
Bk- optimal in T^, [r]. 

Next let us consider this result for two special weight functions. 



IS 



Example 3.3. For the weight function v{x) = 2/(7r\/ 1 — ), Theorem 
3.2 can be simplified substantially. Since 

^ ^ ^ / ^ ^ 2 /-i dx [2, if j = 0, 

Tj{x)v[x)dx = — Tj{x)- —' ' ' 



vr 7-1 y/l — 1 0) otherwise 

(see [19], page 35), we get pi = (\ — r)-^±^ and olq^ = 1, where Vi = 1/2, i £ 
{0,k — 1}, z/j = l,i G {l,...,k — 2}. Hence, for arbitrary r G [0,1], the e/j- 
optimal design in T^[r] is 

~ 1-r/l ''-^ \ 

^ = rvX+ -^—j I -{d-l + Sl) + Y^ 5cos{7r(fc-l-j)/(fe-l)) 1 ■ 



Example 3.4. Finally, we specialize Theorem 3.2 for w = 1 by using 
standard results for Chebyshev polynomials. Let = 1/2 for i G {0, k — 1}, 
let z/j = 1 for i G {1, . . . , A; - 2}, let 

Pi = Pk-i^i = -^—^ - rqi 



1 l^i f 2jiTT \ / 1 1 



COS 



k-1 2fc-l^ \k-lj \2j + l 27-1 
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for i = 0, . . . ,k — 1, and let Oq = min{fj/((/c — l)qi)\i = 0, . . . ,k — 1 with 
qi> 0}. Then for r € [0, a^^] arbitrarily fixed, the design ^ = rA + J2iZo Pi ^ 
^cos(7r(fc-i-j)/(A:-i)) is efc-optimal in T„[r]. In Table 1 we state the values 
Jq'^ = mm{iyi/{{k-l)qi)\i = 0,...,k-l with > 0} for /c = 2, 3, . . . , 8. Next, 
we state the optimal designs in T[r] := Tt,[r] for polynomial regression of 
certain degrees. 

(a) Straight-line regression, that is, k = 2. Let r G [0, 1] be arbitrarily 
fixed. Then the design ^ = rA + (1 — r)^{6-i + Si) is e2-optimal in T[r]. 

(b) Quadratic regression, that is, k = 3. Let r£ [0,3/4] be arbitrarily 
fixed. Then the design rX + {j — + (^i) + — t)'^o ^® e3-optimal in 
T[r]. 

(c) Cubic regression, that is, k = A. Let r G [0,5/6] be arbitrarily fixed. 
Then the design r\ + {\- jq){5-i + 5i) + (| - ^)(5_i/2 + <5i/2) is e4-optimal 
in T[r]. 

(d) Polynomial regression of degree 4, that is, k = 5. Let r G [0, 105/136] 
be arbitrarily fixed. Then the design ^ = ?^A + (| — jg){6-i + 6i) + {\ — 

T§)(^-i/v^ + '^i/V2) + (i - fni)^o is eg-optimal in T[r]. 

It is worth mentioning that in [5] the case that r is near 1 is considered. 
For this case the efc-optimal designs in T[r] can also be explicitly calculated 

for r G [oi'^ , 1] , where a^i^ is some bound that can be calculated. By this 
result, for each r G [0, 1] the e^-optimal designs in T[r] can be calculated for 
k = 3,i. 

APPENDIX: PROOFS OF THEOREMS 3.1 AND 3.2 

We first state an equivalence theorem which is useful for explicitly cal- 
culating optimal designs in T^[r]. Since {M(^)|^ G T^[r]} is convex and 
compact, the proof of the following equivalence theorem is related to equiv- 
alence theorems stated in the literature; see, for example, [17]. But note that 
we consider arbitrary designs of H and not only designs with finite support. 
For details, see [15]. 

Theorem A.l. A design ^ G Tt,[r] is (pp- optimal [p G (— oo, 1)] for 6 
in Tt, [r] if and only if the inequality 

(A.l) Vy G Vz (^S:f{yyNf{y) < fiz^Nfiz) 

Table 1 





(fe) 
«o 


of Example 3.4 




fc 2 3 


4 


5 6 


7 8 


a-^'^ 1 0.75 


0.8333 


0.7721 0.7980 


0.7755 0.7882 
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holds, where S := {z e £\y e > 0: - rv ■ X){{z - e, z + e) D £) > 0} and N = 
M{C)'^K{K'^M{^)-^K)-P-^K^M{C)-^. 

Proof of Theorem 3.1. We consider Theorem A.l with £ = [—1, 1], 
p = -1 and = efc. Then we have N = M {S)'^ euej M {i)'^ and f{xyNf{x) = 
{elM{i)-^f{x)f for all x £ [-1,1]. Note that ejMi^y^fiy) is a polyno- 
mial of degree k — 1 since M{^)~^ is positive definite. Hence, this polynomial 
has at most k extremal points in [—1,1]. Thus, the assertion follows. □ 

In the proof of Theorem 3.2 we use the following lemma. 

Lemma A. 2. Let k>2, let F = ((xi-i)^"^)^' .^^^ G mJ"^^ , where Xi = 

cos( *'^ i = 0, . . . ,k — 1, and let S = diag(. . . , — 1, 1, — 1, 1). Then we 

have 

Vx G [-1, 1] : iJSF-V(x) = rfc_i(x). 

Proof. The function x i— > ij S F^^ f (x) is a polynomial of degree at 
most k — 1. We have 

l^SF-'fixi) = iJSei+i = = cos((fe - 1 - i)7r) 

= cos{{k — 1) arccos(2;j)) = Tfc_i(xj), i = 0, . . . ,k — 1. 

Hence the assertion follows. □ 

Proof of Theorem 3.2. First note that r G [0, a^^^] is a natural condi- 
tion because otherwise ^ is not a measure any more. Let Xi = cos( ^^"^^^^^ ), 
i = 0, . . . ,k — I, be defined as in Lemma A. 2 and let p* := Then |* := 

J2iZo Pi^Xi is an e^-optimal design in E. Hence, 

Vy G [-1, 1] Vi G {0, 1, . . . , - 1} : (ej M{Cr' f{y)f < {ej MiCr' f{xi}f . 

Thus, to show Theorem 3.2 it is sufficient by Theorem 3.1 to show, for some 

7GM, 

(A.2) M(r)-'efc = 7M(e)-^efc. 

Let F,S = diag(. . . , —1, 1, —1, 1) be defined as in Lemma A.2, let R = j\ ff~^ x 
V dX, and let F and R G m('^~i)^'^ be the first k — 1 rows of F and R, respec- 
tively. Further, let p* := (p^, . . . ,pl_^y , q := {qo, . . . where qi is de- 
fined in Theorem 3.2, and let P* := diag(po) • • • Q '■= diag((7o, • • • , Qk-i)- 
Note that M{C) = FP*F'^ , M{Cj = M{C) + r{R- FQF'^). Hence (A.2) is 
equivalent to 

{R - FQF'^)F'^-^P*-^F-''eu = for some 7 G ffi. 
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Thus, to prove the assertion it is sufficient to show 



(A.3) 



{RF' ^SP*''SF^' - FSQP*^^SF-^)ek 
RF^^^Slk - FSq, 



where the last equation holds since Elfving's theorem (efc-optimality of ^* 
in H) gives ^^^^ = FSp* . We define the following functions and matrices 
using Chebyshev polynomials: 

h^'\x) = {To{x),...,n_2ix)V, 



/l(2)(x) = (Tfc_i(x), 



(x)) 



/i(x) = (/i«(x)^,/i(2)(x)T)T^ 

H^^^ = {h^^\xo),...,h('\xk^i)), 
H = {h{xo),. . . ,h{xk-i)). 
With these definitions we can write 



i = i,2, 



q = diag(l/2, 1, 1, . . . , 1, 1/2)H^ J'^ h{ 



x)v{x) dx. 



Since (—1)''" ^ ^Tj{xi) = Tjj^k-i{xi), we get by using problems 1.5.28 and 
1.1.3 in [19], 

^^'^Sq = ^ ^_ g(') diag(l/2, 1, 1, . . . , 1, l/2)H'' j'^ h{x)v{x) dx 



(A.4) 



2{k 

fO 




Vo 1/4 ■ 



1/2 
1/4 1/4 



\ 



1/4/ 



h{x)v{x) dx 



2 J- 



h^^^ {x)Tk-i{x)v{x) dx. 



Next we multiply (A.4) on the left side by the matrix which changes the 
basis from Tq, . . . , Tk-2 to x^ , . . . , x^~'^ and use Lemma A. 2. Then we obtain 
F5q = implying that (A.3) is true. □ 
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